fix: make usage chunk in stream mode of gemini compatible with openai #1503

hustxiayang · 2025-11-06T14:21:05Z

Description

Users found the simply use "usage" information does not work for streaming responses of gemini models.

This is because for openai models, the usage chunk would be a separate chunk. For example, this is an example response from gpt-4o:

...

chunk=ChatCompletionChunk(id='chatcmpl-CYv6DPkWfT1xrsS2ySOoRztQKnZDg', choices=[Choice(delta=ChoiceDelta(content=None, function_call=None, refusal=None, role=None, tool_calls=None), finish_reason='stop', index=0, logprobs=None, content_filter_result={'error': {'code': 'content_filter_error', 'message': 'The contents are not filtered'}})], created=1762438677, model='azure.gpt-4o', object='chat.completion.chunk', service_tier=None, system_fingerprint='fp_4a331a0222', usage=None, obfuscation='2xP')


chunk=ChatCompletionChunk(id='chatcmpl-CYv6DPkWfT1xrsS2ySOoRztQKnZDg', choices=[], created=1762438677, model='azure.gpt-4o', object='chat.completion.chunk', service_tier=None, system_fingerprint='fp_4a331a0222', usage=CompletionUsage(completion_tokens=17, prompt_tokens=12, total_tokens=29, completion_tokens_details=CompletionTokensDetails(accepted_prediction_tokens=0, audio_tokens=0, reasoning_tokens=0, rejected_prediction_tokens=0), prompt_tokens_details=PromptTokensDetails(audio_tokens=0, cached_tokens=0)), obfuscation='xY40HsJr')

There is a finish_reason chunk, and then a usage chunk.

Thus, want to make it compatible with Openai. (Actually, in anthropic translation, it's already compatible)

codecov-commenter · 2025-11-06T14:32:30Z

Codecov Report

❌ Patch coverage is 61.90476% with 8 lines in your changes missing coverage. Please review.
✅ Project coverage is 83.24%. Comparing base (a2ba2c9) to head (8440419).

Files with missing lines	Patch %	Lines
internal/translator/openai_gcpvertexai.go	61.90%	4 Missing and 4 partials ⚠️

❌ Your patch status has failed because the patch coverage (61.90%) is below the target coverage (80.00%). You can increase the patch coverage or adjust the target coverage.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1503      +/-   ##
==========================================
- Coverage   83.26%   83.24%   -0.02%     
==========================================
  Files         137      137              
  Lines       12059    12069      +10     
==========================================
+ Hits        10041    10047       +6     
- Misses       1411     1413       +2     
- Partials      607      609       +2

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

hustxiayang · 2025-11-06T15:43:43Z

/retest

nacx · 2025-11-11T15:40:52Z

@yuzisun can you take a look at this one?

Signed-off-by: yxia216 <[email protected]>

… a tool call (envoyproxy#1486) **Description** Finish reason should be tool calls if the model returns a tool call response. In vertex api, there is no tool call finish reason, thus need a work around to make it compatible. --------- Signed-off-by: yxia216 <[email protected]> Co-authored-by: Dan Sun <[email protected]> Signed-off-by: yxia216 <[email protected]>

…oxy#1491) **Description** This decouples backendauth & headermutator packages from extproc specifics. As we are looking to migrate to dynamic modules, this is a necessary refactoring work to make the code as reusable as possible. **Related Issues/PRs (if applicable)** Preliminary for envoyproxy#90 --------- Signed-off-by: Takeshi Yoneda <[email protected]> Signed-off-by: yxia216 <[email protected]>

Signed-off-by: yxia216 <[email protected]>

hustxiayang · 2025-12-01T03:21:08Z

/retest

hustxiayang · 2025-12-01T03:31:28Z

/retest

hustxiayang · 2025-12-05T01:44:29Z

/retest

hustxiayang · 2025-12-05T02:01:05Z

/retest

yuzisun · 2025-12-05T02:21:00Z

internal/translator/openai_gcpvertexai.go

+			}
+
+			if span != nil {
+				span.RecordResponseChunk(openAIChunk)


This looks like copy/paste error, should be usageChunk ?

updated! Thanks a lot for the comment!

Signed-off-by: yxia216 <[email protected]>

hustxiayang requested a review from a team as a code owner November 6, 2025 14:21

dosubot bot added the size:M This PR changes 30-99 lines, ignoring generated files. label Nov 6, 2025

hustxiayang force-pushed the gemini-usage branch from 3fc0bf1 to 35bff4c Compare November 6, 2025 14:22

dosubot bot added size:XXL This PR changes 1000+ lines, ignoring generated files. and removed size:M This PR changes 30-99 lines, ignoring generated files. labels Nov 6, 2025

hustxiayang changed the title ~~fix: make usage chunk similar to Openai.~~ fix: make usage chunk in stream mode of gemini compatible with openai Nov 6, 2025

dosubot bot added size:M This PR changes 30-99 lines, ignoring generated files. and removed size:XXL This PR changes 1000+ lines, ignoring generated files. labels Nov 6, 2025

hustxiayang and others added 6 commits November 29, 2025 18:06

fix-usage-chunk

fef4558

Signed-off-by: yxia216 <[email protected]>

fix-tests

7f97a77

Signed-off-by: yxia216 <[email protected]>

rebase

98cf408

Signed-off-by: yxia216 <[email protected]>

rebase

30a3391

Signed-off-by: yxia216 <[email protected]>

hustxiayang force-pushed the gemini-usage branch from 66eaa8c to 30a3391 Compare November 29, 2025 23:26

hustxiayang added 2 commits November 29, 2025 21:13

Merge branch 'main' into gemini-usage

03bea69

rebase

4993625

Signed-off-by: yxia216 <[email protected]>

Merge branch 'main' into gemini-usage

7d6d073

yuzisun reviewed Dec 5, 2025

View reviewed changes

fix-comments

8440419

Signed-off-by: yxia216 <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: make usage chunk in stream mode of gemini compatible with openai #1503

fix: make usage chunk in stream mode of gemini compatible with openai #1503

Uh oh!

hustxiayang commented Nov 6, 2025 •

edited by yuzisun

Loading

Uh oh!

codecov-commenter commented Nov 6, 2025 •

edited

Loading

Uh oh!

hustxiayang commented Nov 6, 2025

Uh oh!

nacx commented Nov 11, 2025

Uh oh!

hustxiayang commented Dec 1, 2025

Uh oh!

hustxiayang commented Dec 1, 2025

Uh oh!

hustxiayang commented Dec 5, 2025

Uh oh!

hustxiayang commented Dec 5, 2025

Uh oh!

yuzisun Dec 5, 2025

Uh oh!

hustxiayang Dec 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

fix: make usage chunk in stream mode of gemini compatible with openai #1503

Are you sure you want to change the base?

fix: make usage chunk in stream mode of gemini compatible with openai #1503

Uh oh!

Conversation

hustxiayang commented Nov 6, 2025 • edited by yuzisun Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov-commenter commented Nov 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

hustxiayang commented Nov 6, 2025

Uh oh!

nacx commented Nov 11, 2025

Uh oh!

hustxiayang commented Dec 1, 2025

Uh oh!

hustxiayang commented Dec 1, 2025

Uh oh!

hustxiayang commented Dec 5, 2025

Uh oh!

hustxiayang commented Dec 5, 2025

Uh oh!

yuzisun Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

hustxiayang Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

hustxiayang commented Nov 6, 2025 •

edited by yuzisun

Loading

codecov-commenter commented Nov 6, 2025 •

edited

Loading